A Pulse Model in Log-domain for a Uniform Synthesizer
نویسندگان
چکیده
The quality of the vocoder plays a crucial role in the performance of parametric speech synthesis systems. In order to improve the vocoder quality, it is necessary to reconstruct as much of the perceived components of the speech signal as possible. In this paper, we first show that the noise component is currently not accurately modelled in the widely used STRAIGHT vocoder, thus, limiting the voice range that can be covered and also limiting the overall quality. In order to motivate a new, alternative, approach to this issue, we present a new synthesizer, which uses a uniform representation for voiced and unvoiced segments. This synthesizer has also the advantage of using a simple signal model compared to other approaches, thus offering a convenient and controlled alternative for future developments. Experiments analysing the synthesis quality of the noise component shows improved speech reconstruction using the suggested synthesizer compared to STRAIGHT. Additionally an experiment about analysis/resynthesis shows that the suggested synthesizer solves some of the issues of another uniform vocoder, Harmonic Model plus Phase Distortion (HMPD). In text-to-speech synthesis, it outperforms HMPD and exhibits a similar, or only slightly worse, quality to STRAIGHT’s quality, which is encouraging for a new vocoding approach.
منابع مشابه
A Study of Electromagnetic Radiation from Monopole Antennas on Spherical-Lossy Earth Using the Finite-Difference Time-Domain Method
Radiation from monopole antennas on spherical-lossy earth is analyzed by the finitedifference time-domain (FDTD) method in spherical coordinates. A novel generalized perfectly matched layer (PML) has been developed for the truncation of the lossy soil. For having an accurate modeling with less memory requirements, an efficient "non-uniform" mesh generation scheme is used. Also in each time step...
متن کاملFDTD Analysis of Top-Hat Monopole Antennas Loaded with Radially Layered Dielectric
Top-hat monopole antennas loaded with radially layered dielectric are analyzed using the finite-difference time-domain (FDTD) method. Unlike the mode-matching method (MMM) (which was previously used for analyzing these antennas) the FDTD method enables us to study such structures accurately and easily. Using this method, results can be obtained in a wide frequency band by performing only one ti...
متن کاملSolute Transport for Pulse Type Input Point Source along Temporally and Spatially Dependent Flow
In the present study, analytical solutions are obtained for two-dimensional advection dispersion equation for conservative solute transport in a semi-infinite heterogeneous porous medium with pulse type input point source of uniform nature. The change in dispersion parameter due to heterogeneity is considered as linear multiple of spatially dependent function and seepage velocity whereas seepag...
متن کاملFew-cycle femtosecond field synthesizer.
We report on an optical field synthesizer consisting of a CEO-phase stabilized octave-spanning Ti:sapphire laser oscillator, a double-LCD prism-based pulse shaper, and a SPIDER pulse characterization apparatus. This field synthesizer allows for generating pulses with durations as short as 3.6 fs and enables to control the electric field on a sub-cycle scale. Within the limits of the ultrabroad ...
متن کاملSolute Transport for Pulse Type Input Point Source along Temporally and Spatially Dependent Flow
In the present study, analytical solutions are obtained for two-dimensional advection dispersion equation for conservative solute transport in a semi-infinite heterogeneous porous medium with pulse type input point source of uniform nature. The change in dispersion parameter due to heterogeneity is considered as linear multiple of spatially dependent function and seepage velocity whereas seepag...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016